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will  also  agree  that  there  is  another  whole  class  of  cases,  in  which  we  are  concerned 
with  unique  events,  in  which  we  lack  statistical  knowledge,  and  for  which  we  must  turn 
to  subjective  probability  or  one  of  its  surrogates.  I  purpose  here  to  argue  against 
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"unique"  events  is  not  only  untenable,  but  seriously  misleading. 
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1.  statistics  and  unique  events. 

Almost  everyone  will  agree  that  when  our  background^ statistical 
knowledge  is  extensive  enough,  and  when  the  case  wi^  which  we  are 
concerned  is  a  "repeatable  event",  Chen  objective  probabilities  are 
appropriate,  and  these  are  the  probabilities  that  should  enter  into  Che 
computation  of  expectation  and  into  our  decision  theory.  A  great  many 
people  will  also  agree  Chat  there  is  another  whole  class  of  cases,  in 
which  we  are  concerned  with  unique  events,  in  which  we  lack  statistical 
knowledge,  and  for  which  we  must  turn  Co  subjective  probability  or  one  of 
its  surrogates.  I  propose  here  to  argue  against  this  distinction.  Of 
course  it  is  easy  enough  to  argue  this  way  in  a  purely  philosophical  vein: 
every  event  must  be  unique  —  it  has  its  own  spatio-temporal  locus;  and 
every  unique  event  must  belong  Co  some  class  of  events  about  which,  in 
principle,  we  could  have  statistical  knowledge.  But  this  is  not  my  point. 
My  point  is  that  from  a  down-to-earth  practical  point  of  view,  from  the 
point  of  view  that  seeks  to  compute  probabilities  and  expectations  for 
making  decisions,  the  distinction  between  'repeatable'  and  'unique'  events 
is  not  only  untenable,  but  seriously  misleading. 
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2.  kinds  of  cases. 

Let  us  consider  some  examples  of  these  alleged  distinctions. 

Consider  a  toss  of  a  coin.  There  is  a  classical  'repeatable'  event:  not 
only  can  we  toss  a  coin  over  and  over  again;  coins  have  been  tossed  over 
and  over  again,  and  in  the  experience  of  each  of  us  there  is  a  large  data¬ 
base  of  results  of  coin  tosses  (or  an  impressionistic  resume  of  such  a 
data  base).  And  we  have  physical  grounds  (i.e.,  grounds  stemming  from  the 
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laws  of  physics)  for  thinking  that  coins  land  heads  about  half  the  time. 
And  so  ve  can  regard  that  toss  of  that  coin  as  a  member  of  a  class  of 
tosses,  of  which  we  have  reason  to  believe  that  half  yield  heads. 
(Alternatively  we  might  regard  that  toss  as  a  kind  of  trial  that  has  a 
propensity  of  a  half  to  yield  heads.) 

Nov  of  course  a  particular  toss,  at  a  particular  time  and  place 
cannot  be  repeated.  We  all  know  that.  But  the  event  can  be  repeated  in 
'all  relevant  respects'.  We  don't  have  to  make  the  toss  at  the  same  time 
or  the  same  place;  we  don't  have  to  use  the  same  coin;  we  don't  have  to 
use  the  same  kind  of  coin;  we  don't  have  to  flip  it  in  any  particular  way. 

Consider  another  case.  Suppose  I  am  interested  in  whether  or  not  my 
friend  Sam  will  be  at  home  tonight  after  supper.  I  know  that  on  some 
week**nights  he  goes  to  the  movies,  but  that  he  never  goes  anywhere  else. 
But  this  doesn't  give  me  a  'repeatable  event'.  Whether  or  not  he  goes  to 
the  movies  depends  on  factors  that  I  can  specify.  Unlike  the  flip  of  the 
coin,  I  can  be  quite  sure  that  there  are  factors  that  influence  his 
behavior  —  and  I  can  even  specify  some  of  them.  Thus  I  might  know  that 
he  likes  Westerns  —  so  if  there  is  a  good  Western  in  town  he  will  be  more 
likely  to  be  at  the  movies.  On  the  other  hand,  I  might  know  that  Sam  is 
very  conscientious  about  studying  for  his  chemistry  examinations  —  so  if 
there  were  to  be  a  chemistry  examination  tomorrow,  Sam  would  most  likely 
be  at  home.  In  short,  this  seems  like  a  perfect  case  for  subjective 
probability.  But  if  I  have  known  Sam  for  a  long  time,  I  ^  have  a  basis 
for  knowing  how  often,  in  general,  he  goes  to  the  movies  on  week-nights. 
My  knowledge  is  neither  so  precise  nor  so  secure  as  my  knowledge  about  the 
coin,  but  it  is  surely  not  non-existent.  Similarly,  those  factors  I  just 
named  (Sam's  conscientiousness  about  chemistry;  his  liking  for  Westerns), 
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so  far  from  accentuating  the  contrast  with  the  well-known  coin,  create  a 
greater  similarity.  It  is  as  if  we  did  know  something  special  about 
tosses  of  coins  of  different  denominations:  quarters  tend  to  land  heads 
more  often  than  other  coins,  dimes  less  often. 

Of  course,  the  better  1  know  Sam,  the  more  precise  1  can  be  about 
this  probability.  But  this  just  means  that  I  know  more  about  the  relative 
frequency  with  which  he  goes  to  the  movies.  If  his  life  were  to  change  a 
lot,  I  would  regard  my  knowledge  of  this  frequency  as  suspect.  But  if  the 
coin  were  to  have  been  run  over  by  a  trolley-car,  I  would  regard  my 
knowledge  of  it  as  suspect. 

We  must  beware  of  allowing  Che  variety  of  our  knowledge  about  Sam  to 
serve  as  an  excuse  for  guessing  wildly.  Analogously,  if  we  were  to  have 
detailed  and  microscopic  data  concerning  the  coin  toss,  we  could  perhaps 
predict  with  a  better  than  50Z  success  rate.  This  possibility  should  not  be 
allowed  to  undermine  our  sensible  tendency  to  assign  a  probability  of  1/2  to 
the  occurrence  of  heads  on  the  specified  toss  when  we  lack  that  microscopic 
data. 

Finally,  there  are  some  circumstances  under  which  my  probability 
concerning  Sam's  being  at  home  is  just  as  exact  as  my  probabili  ty 
concerning  the  flip  of  a  coin.  For  example,  I  may  know  that  he  decided 
whether  or  not  to  go  to  the  movies  by  flipping  an  ordinary  coin.  Suppose 
in  addition  that  1  know  that  he  went  out  if  and  only  if  the  coin  landed 
heads.  Then  the  probability  that  Sam  went  to  the  movies  is  one-half. 

This  brings  out  an  important  point  that  I  shall  enshrine  as  an  axiom: 

A1  If  and  T^ are  known  to  have  the  same  truth-value  then 
they  have  the  same  probability. 

This  axiom  does  not  require  that  ^  and  ^be  equivalent  in  any  strong 
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sense;  ell  that  is  required  is  that  ve  know  that  they  have  the  same  truth 
value. 

This  axiom  already  undermines  the  argument  for  subjectivity  based  on 
uniqueness.  It  is  asked,  "How  can  you  find  an  objective  probability  for  the 
event  of  an  Indian  plutonium  plant  suffering  a  melt-down,  when  there  is  no 
class  of  instances  to  generalize  from:  there  is  only  one  Indian  plutonium 
plant,  its  design  is  unique,  etc."  The  long  answer  is  that  there  will  be  a 
melt-down  if  and  only  if  there  is  a  failure  of  the  cooling  system  or  a 
failure  of  the  rod  control  system.  In  turn  we  can  break  these  propositions 

into  equivalent  ones  until  we  have  quite  small  and  quite  general  statements 

about  which  we  ^  have  statistical  data.  (Gate  valves  of  the  kind  in  the 
Indian  plant's  cooling  system  have  a  failure  rate  of  about  one  per  ten 
thousand  hours  of  use.)  But  how  do  we  use  statistical  data  when  we  do  have 
it? 

Here  is  another  example.  I  hold  in  my  hand  a  newly  minted  coin.  I 

will  toss  it  once  and  then  it  will  be  melted  down.  What  is  the  probability 

that  this  coin  will  yield  heads  when  tossed?  The  relative  frequency  of  heads 
among  tosses  of  this  coin  is  0  or  1  -  we  have  no  statistical  knowledge  of 
the  behavior  of  this  coin.  But  we  know  that  the  toss  of  this  coin  I  am 
about  to  perform  will  yield  heads  iff  the  next  toss  of  a  coin  performed  by 
me  yields  heads  -  and  for  tosses  of  coins  in  general  we  have  lots  of 
statistical  evidence. 

3.  A  Simple  System 

Here  is  a  very  simple  example  of  how  objective  (frequency  or  chance) 
probability  can  be  applied  to  "unique  events".  It  is  essentially  due  to 
Reichenbach  (1949).  It  is  a  step  backward  from  the  discussion  of  the 
previous  section,  but  we  will  regain  our  insights  in  the  following 
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section. 


Let  R  •  be  a  finite  set  of  potential  reference  classes,  let 

P  -  {Pi-Pkl  be  a  finite  set  of  properties  (including  such  properties  as 
being  a  member  of  a  particular  reference  class),  and  let  ^  *  ^i...i^  be 
a  set  of  distinct  individuals  (or  individual  events). 

We  can  define  a  language  on  this  basis  in  the  usual  way. 

Add  to  this  language  enough  mathematics  to  do  statistics,  and  define 
an  item  of  possible  statistical  knowledge  to  be  a  sentence  of  the 
syntactical  form;  "X(rj,Ps^  ■  which  we  read:  the  proportion  of 
objects  in  the  reference  class  rj^  that  have  the  property  Pg  is  3^ 

PropcAftions  satisfy  the  classical  probability  axioms. 

Let  a  body  of  knowledge  R  be  a  set  of  sentences.  We  impose  few 
restrictions  on  K:  we  want  it  to  be  consistent  in  the  sense  that  there 
should  be  no  sentence  S  in  K  for  which  S  is  also  in  K;  we  want  some 
logical  truths  in  K  (we  could  essentially  enumerate  them);  and  we  want  the 
items  of  possible  statistical  knowledge  to  be  consistent  statements 
concerning  the  relations  of  the  possible  reference  classes  and  properties. 
(For  example,  one  would  not  want  to  have  "(x)(P^(x)  Eg,(5))"»  jfe 

^(rjjPj)"  both  in  jL  Finally,  we  want  the  (finite  number  of)  sentences  of 
the  form  '?^(i^)"  together  with  "0  ■  0"  and  "0  *  1"  to  generate  a  partition 
of  all  the  sentences  of  the  language  under  the  relation  of  having  the  same 
truth  value. 

These  constraints  are  embodied  in  the  following  axioms: 

A2.1  R  is  closed  under  intersection. 

A2.2  If  ^  ^  j,  r^  «  R,  ry  C  R,  then  r .  ■  r^  . 

A2.3  If  C  then  "rj  c.  "«  K. 

A3.1  P  is  closed  under  conjunction  and  negation. 
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A3. 2  If  j,  Pj  C  P,  Pj  €  P,  then  |/Ax(^(x)  P.x). 

A3. 3  If  |-/\i(P^(x)  ""^Pj^x))  then  ->  (jc) )"  4. 

A4  If  r^"  4  K  and  ”i„  4  _?■  4  K.  then 

A5.1  If  "S  4  K  then  "S^4->Sj^"  e  L 

A5.2  If  and  are  in  ^  then  "S^'»->S^"4  K. 

A5.3  S/'  4 

A6  For  every  non-mathematical*  sentence  in  _L,  there  exists 
a  and  exactly  one  such  that  4-_^. 

A7  There  exists  a  model  of  the  sentences  in  with  "Z(X,Y)" 
construed  as  the  proportion  of  ^'s  that  are  Jf^'s. 

We  can  now  define  the  probability  of  a  sentence  ^  relative  to  a  body  of 
knowledge  to  be  x  just  in  case  ^  is  known  in  K  to  be  equivalent  to  a 
sentence  of  the  form  —  this  is  just  to  say  that  the  biconditional 

"Sa^P^i^)”  is  in  ^  —  and  for  some  reference  class  r^to  which  known 

to  belong,  "Z(3^^,Pjt^  *  S"  finally,  if  is  another  reference 

class  to  which  i^  is  known  to  belong,  and  "Z(j^i  »E^)<>x"  is  in  then  it  is 
known  that  is  included  in  .  (This  is  just  to  say  that  'Ir^C  4^*) 
Formally: 

D1  Prob(S,K)  ■  x^ iff  there  are  P^f  and  r^  such  that 

(1)  "^^Pj(i^)"  i»  in  5.. 

(2)  £.• 

^3)  "X(jy,fP^  *  *!'  i«  in  K* 

(4)  If  "i^J4  Iju*"  in^»  *nd  ■  x"  is  in  K, 

then  c  ”  is  in  K. 

^us  £jLs  the  smallest  reference  class  about  which  we  have  statistical 
information  to  which  i,.  is  known  to  belong.  This  is  essentially 
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Reichenbach' s  idea  except  for  the  addition  of  axiom  Al. 


We  can  generate  the  probability  more  clearly  by  putting  the  fourth 
condition  as  a  constraint  on  a  table.  Let  the  first  column  of  the  table 
contain  a  list  of  all  the  reference  classes  to  which  i^  is  known  to 
belong.  Let  the  second  column  contain  the  value  of  x(r^)  from  the 
corresponding  item  of  possible  statistical  knowledge:  *  x(l^)." 

Work  down  the  table,  deleting  every  row  that  fails  condition  (4).  (Rule:  if 


)  /  *(  fui  )f  delete  both  rows  unless  "ru.  c  ^  There  may 

- -  "  "'4  “ 

be  several  rows  left,  but  they  will  all  mention  the  same  value  of  x.  There 


may  be  no  rows  left. 


4.  Limitations. 

This  approach  deals  perfectly  reasonably  with  tosses  of  coins  and  the 
like.  It  also  does  what  we  want  for  Sam,  in  the  case  in  which  he  decides 
whether  or  not  to  go  to  the  movies  by  tossing  a  coin.  But  it  has  serious 
drawbacks.  It  fails  to  provide  for  the  case  in  which  we  get  the  probabilty 
of  Sam  going  to  the  movies  from  a  limited  statistical  basis.  It  gives  us  no 
probability  at  all  when  we  know  of  i^  that  it  belongs  to  two  reference 
classes,  our  knowledge  about  those  reference  classes  doesn't  agree,  and  we 
don't  know  that  either  reference  class  is  included  in  the  other.  Example: 

I  know  that  chemistry  majors  go  to  the  movies  half  the  time  on  week  nights;  I 
know  that  computer  science  majors  go  to  the  movies  lOZ  of  the  time;  Sam  is  a 
double  major  in  both  computer  science  and  chemistry;  but  I  have  no  knowledge 
of  how  frequently  such  double  majors  go  to  the  movies. 

The  remedy  is  simple  and  obvious,  but  it  entails  considerable 
complication.  We  allow  items  of  possible  statistical  knowledge  to  embody 


approx inuite  knowledge.  To  stick  to  one  syntscical  form,  let  us  write 

to  mean  that  the  proportion  of  objects  (or  the  chance  of  an  object)  in  the 
reference  class  r^  having  the  property  is  in  the  closed  interval 

Suddenly  we  have  statistical  knowledge  about  every  property  and  every 
reference  class:  at  the  very  least  we  will  know  that  the  proportion  lies  in 
[o,i].  And  now  what  do  we  mean  by  "  <>  "?  These  changes  workt  Say  that 
two  intervals  "differ"  if  neither  is  included  in  the  other,  and  rewrite  (4) 
to  say  that  if  differ,  then  r.^is  known  to  be  included  in  : 

(4')  If  "i^»  r^i"  is  in  K  and  "Z(r^,  ,Pj)  4  j^j  ,x'^  "  is  in  K, 
and  ,x;^]  differs  from  ,  then  "r^Cr^/"  is  in 

And,  finally,  we  must  include  another  clause  to  single  out  for  us  the  most 
informative  interval  that  is  not  ruled  out  be  conflict  with  another 
interval: 

(5)  If  X|j'  ^es  not  been  eliminated  as  a  possible  reference  class 
by  the  earlier  conditions,  then  the  interval  corresponding  to 
is  a  subinterval  of  the  interval  corresponding  to  r^j. 

This  definition  of  probability  is  still  limited  —  it  turns  out  that  we 
would  like  to  other  relations,  in  addition  to  the  subset  relation,  to  excuse 
"difference".  And  we  would  like  to  to  able  to  consider  equivalence  to 
statements  concerning  several  different  individuals.  (A  general  definition 
along  these  lines  is  provided  in  Kyburg  1985.)  But  it  is  already  quite 
powerful,  and  it  has  some  rather  interesting  properties: 

(1)  All  probabilities  are  objective,  in  the  sense  that  each 
probability  is  based  on  empirical  knowledge  about  frequencies  or  chances  in 
the  world. 

(2)  Every  statement  in  the  language  has  a  probability:  there  is  no 
distinction  between  statements  concerning  "repeatable"  events  and  statements 
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concerning  "unique"  events. 

(3)  ^  a  priori  probabilities  are  required;  all  probabilities  can 

be  based  on  experience.  (But  hov  they  can  be  so  based  is  another  story.) 

5. Conclusion 

Probabilistic  knowledge  may  be  regarded  as  all  of  a  piece.  There  is  no 
need  to  distinguish  between  "statistical"  probabilities  that  have  objective 
warrant  in  the  world,  and  "subjective"  probabilities  that  merely  reflect  our 
subjective  feelings.  When  we  apply  our  knowledge  of  statistical  facts  to 
individual  cases, (as  we  always  do  when  it  comes  to  decision  theory)  it  is 
the  probability  of  a  unique  event  that  is  at  issue  (che  next  toss).  When  we 
offer  a  "subjective"  probability  for  a  unique  event,  it  is,  if  it  has  any 
epistemological  justification  at  all,  based  on  some  (possibly  approximate) 
statistical  knowledge.  The  difference  between  the  two  cases  lies  in  the 
fact  that  in  the  former  case  it  is  easy  to  specify  the  reference  class  —  it 
may  even  be  built  into  the  problem  through  the  use  of  the  indefinite  articles 
"a"  and  "an"  —  and  in  the  latter  case,  it  may  be  quite  difficult  to  put 
your  finger  on  the  reference  class.  But  this  is  a  difference  of  degree,  and 
not  of  kind;  and  the  procedures  suggested  here,  and  in  more  detail  in 
(Ryburg  1985)  can  render  both  kinds  computable  within  quite  rich  languages. 
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Note: 

*  A  non-mathematical  sentence  is  one  whose  truth-value  does  not  depend  on 
the  truth  of  a  mathematical  statement.  Thus  the  conjunction  (or 
disjunction)  of  "Heads  on  toss  #7"  and  Fermat's  last  theorem  is  a 
mathematical  sentence,  and  so  is  "the  toss  indexed  by  the  fourth  prime 
yields  heads."  But  the  latter  is  equivalent  to  "Toss  number  seven  yields 
heads."  The  point  is  to  avoid  worrying  about  undecidable,  and  even  semi- 
decidable,  sentences. 
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